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immunoglobulin heavy chain; (b)Modifying the nucleotide sequence in the region of the nucleotide sequence encoding the C-termi- 
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Antibodies 

DescriptionThe invention relates to methods of improving the secretion of antibodies from 
the cells of organisms, such as plants and antibodies having heavy chains which are 
modified at the C-terminus ends in order to improve their secretion from a cell expressing 
the antibody. Modifications to the C-terminus ends of heavy chains of antibodies to allow 
binding of the heavy chain to J-chains are also included in the invention. 

Plants are an efficient and versatile system to express foreign proteins and can even 
produce complex, multimeric proteins in large amounts. One such example is the 
expression of a decameric, secretory immunoglobulin known as IgA , a molecule 
composed of two IgA units (containing 2 heavy and 2 light chains), a joining J chain and a 
secretory component It has been shown that transgenic tobacco cells are able to translocate 
all the IgA subunits in the endoplasmic reticulum (ER), where complex assembly occurs 
with high efficiency . The overall production of IgA is very high, making plants the system 
of choice for the expression of these molecules . 

US 6,417,429 and related International Patent Application No. WO 97/42313 (Scripps) 
disclose the expression of multimeric proteins, such as antibodies in plants. IgA is a 
secretory type of antibody. It contains a domain at the C-terminal end of each of its heavy 
chains (the Ca3 domain, see for example Figure 1 of the current application) which 
contains a cysteine residue which binds the peptide, known as the J chain. Naturally 
occurring IgA exists as two monomeric units (a unit is defined as a tetramer composed of 
two heavy chains and two light chains) of IgA antibodies joined by the J chain, in 
combination with a fourth component, a polypeptide known as the secretory component. 
This produces a complex which is resistant to degradation caused by proteases present in 
mucosal environments such as the oral cavity. 

The Ccc3 domain has been incorporated into, for example, the heavy chains of IgG antibody 
heavy chains (which cannot naturally bind J chains) to form IgA/G hybrids capable of 
binding J chains. This improves the stability of the antibody without altering antigen 
binding or specificity. 

Methods of expressing antibodies are discussed in detail in US 6,417,429 and WO 
97/42313. The United States patent discloses the prediction of plant cells containing 
nucleotide sequences encoding immunoglobulin heavy- and light-chain polypeptides, each 
polypeptide containing a leader sequence forming a secretion signal and immunoglobulin 
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molecules encoded by the nucleotide sequences where the leader sequences cleaved from 
the immunoglobulin molecules following pioteolitic processing in the plants. The method 
for producing such transgenic plants is achieved, for example, by introducing into the 
genome of a first member of the plant species a first mammalian nucleotide sequence 
encoding an immunoglobulin heavy chain containing polypeptide including a leader 
sequence forming a secretion signal to produce a first transformed cell. A second 
mammalian nucleotide sequence .encoding an immunoglobulin light chain containing 
polypeptide including a leader sequence forming a secretion signal is then inserted into the 
genome of a second plant species to produce a second transformant. The two transformed 
plants are then cross-pollinated to create progeny. Transgenic plant species are isolated 
from the progeny which are capable of producing an immunoglobulin molecule, where the 
leader sequence is cleaved from the immunoglobulin molecules by proteolytic processing 
to produce the completed antibody. The completed antibody typically comprises four 
polypeptide chains, two identical light chains (L) and two identical heavy chains (H). The 
four chains are held together by a combination of non-covalent interactions and covalent 
bonds. The molecule is typically composed of two identical halves in which both L and H 
chains contribute almost equally to the two identical antigen binding sites. 

IgM is another major class of antibody. This is secreted into the blood in the early stages 
of a primary antibody response. In the secreted form IgM is a pentamer comprised of five 
4-chain units and thus has a total of ten antigen-binding sites. Such pentamers contain a 
copy of a J chain polypeptide joining the C-terminus end of the heavy chains of two 
adjoining 4-chain units. 

The inventors have recently discovered that not all the assembled antibody, whose fate is 
secretion in mammals, is secreted by plant cells. Rather, a high proportion of the molecules 
are retained intracellularly and eventually delivered to vacuoles, where they are ultimately 
degraded . This transport and subsequent degradation in the vacuolar compartment can be 
inhibited by treatment with the fungal metabolite Brefeldin A and all IgA assembly 
intermediates -from the decamer to the simplest, heterotetrameric unit- share the same 
intracellular fate. In contrast, when the parent IgG molecule was expressed, it was 
efficiently secreted . As the kappa light chain is the same for both IgG and IgA/G 
molecules, this led the inventors to speculate that features of the IgA/G heavy chain might 
be responsible for this intracellular diversion. Alternatively, the nature of the heavy chain 
could impose stress on the endoplasmic reticulum, causing the subsequent missorting to the 
vacuole. 
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The Frigerio paper does suggest that the a-domains present in the hybrid IgA/G heavy 
chains might be involved in some way in intracellular retention and vacuolar delivery. 
However, the paper highlighted a wide variety of potential areas which looked as though 
they could be involved in intracellular retention and vacuolar delivery. 

Firstly, there was the possibility that the endoplasmic reticulum was stressed by the 
presence of the recombinant protein. There was the potential that this stress caused the 
endoplasmic reticulum to reject the protein in some way. The inventors have now proved 
that this is not the case, as the endoplasmic reticulum of these plants can cope with the 
co-expression of a further foreign protein without disposing of it. 

There was the further possibility that a-domains cause structural defects which lead to 
endoplasmic reticulum retention and eventual disposal in vacuole. The inventors have now 
demonstrated that this is not the case as the vacuolar fragments derived from assembled 
sub-units, and the immunoglobulin would not assemble if it had structural defects. 

The paper also suggested that cysteine residues were in some way involved. In mammalian 
cells, exposed cysteine residues in the C-terminal region of the heavy chains of IgA and 
secretory IgM, are recognised by endoplasmic reticulum quality control. The observations 
in the paper, that the parent IgG tetramers, which do not have free cysteines, are secreted 
with high efficiency, was thought to be consistent with the possibility that thiol-mediated 
retention also occurs in plant ER. This strongly suggested that cysteine residues on the 
IgA/G units were involved in the retention. This has now been proved by the inventors not 
to be the case as treatment with the reducing agents does not cause more secretioa The 
treatment of reducing agents in mammals has been previously observed to increase 
secretion of proteins. 

t 

The paper also suggested that the Ca2 domain was involved. The hybrid immunoglobulins 
contain extra Ca2 domains. The paper suggests that extra Ccc2 domains in the heavy chain 
might affect interactions with chaperones or recognition by quality control mechanisms in 
the endoplasmic reticulum. Again, the inventors have now identified that the C<x2 domain 
is not involved in intracellular retention and vacuolar delivery. 

The inventors therefore looked at the Ca3 domain. This is a domain of approximately 120 
amino acid residues. The inventors have now unexpectedly found that the last 18 amino 
acid residues at the C-terminus end are involved in intracellular retention and yacuolar 
delivery. Furthermore, they have identified that this particular region also has high levels 
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of homology with IgM and they are therefore likely to be involved in the retention of 
recombinant antibodies containing IgM-derived heavy chains. 

Furthermore, the inventors have realised that simply deleting the last 18 amino acids from 
the C-terminal end of such antibodies improves secretion, it also results in the removal of 
the residue required for binding of the antibody to the J-chain polypeptide. This removed 
the ability to generate multimeric antibody molecules. The inventors have therefore 
identified a method of reinstating the ability of the recombinant antibody to bind J-chain 
polypeptides. The latter method may also be used to introduce J-chain binding capability 
to non-IgA or IgM derived antibodies. 

Accordingly, the first aspect of the invention provides a method of making an antibody 
molecule, the antibody con tainin g a heavy chain comprising an cc3 domain or a mu domain, 
the method comprising: 

(a) providing a nucleotide sequence encoding the immunoglobulin heavy chain; 

(b) modifying the nucleotide sequence in the region of the nucleotide sequence 
encoding the last 18 amino acids of the completed heavy chain to remove, or reduce the 
effectiveness of, one or more targeting signal sequences to form a modified nucleotide 
sequence; 

(c) inserting the modified nucleotide sequence into a host cell; and 

(d) causing the host cell to express the modified nucleotide sequence to form the 
modified heavy chain and secrete the modified antibody heavy chain from the cell. 

The a3 domains are derived from immunoglobulins classed as immunoglobulin A (IgA 
chains). Mu domains are derived from immunoglobulin M (IgM heavy chains). The heavy 
chains may be from IgA or IgM heavy chains or may be hybrids of other classes of 
immunoglobulins which have had an ct.3 domain or a mu domain added to their heavy 
chain. Such hybrids include the known IgA/G hybrids. 

Typically, the heavy chain is co-expressed with a light chain within the cell and is secreted 
as a tetramer containing two heavy chains and two light chains. 

The modified antibody molecule has improved secretion compared with non-modified 
antibody. 
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The nucleotide sequence may be, for example, a DNA or RNA molecule encoding the 
immunoglobulin heavy chain. 

The nucleotide sequence modified comprises one or more of the nucleotides which encode 
the last 1 8 amino acids of the completed heavy chain. That is, from the C -terminal end of 
the heavy chain once the secreted antibody has been exported from the host cell and 
optionally further processed to form the completed antibody molecule. The completed 
antibody molecule may be in the form of separate heavy chains. More preferably, the 
completed antibody molecule consists of four polypeptide chains, two light chains and two 
heavy chains, the heavy chains of which comprise an cc3 domain or a mu domain which has 
been modified. Two or more of the four polypeptide units may be in turn joined together, 
for example to form a dimeric IgA molecule containing a J-chain or, for example, a 
pentameric IgM molecule which also comprises a J-chain. 

The modification of the nucleotide sequence may be via one or more mutations to alter the 
amino acid sequence for which the nucleotide sequence encodes. This may be via one or 
more point mutations, deletions, additions or replacement of one or more of the nucleotides 
with alternative nucleotides. Preferably nucleotides encoding 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 
15, 16, 18 or more of the C-tenninal amino acids are deleted as a result of modifying the 
nucleotide sequence encoding them. The alteration causes the amino acid sequence of 
targeting signal to be changed so that it is no longer recognised by the host cell, for 
example, as a vacuolar sorting signal. 

The review article by Matsuoka K. and Neuhaus J.M., (J. Exp. Bot. Vol. 50 (1999), pages 
165-174) suggests that the function of C-tenninal vacuolar sorting signals can be 
decreased, and even abolished, by adding two to four glycine residues downstream of the 
sorting signal. The inventors of the present application expect that addition of at least two, 
preferably two to four, glycine residues downstream of a C-tenninal sorting signal of an 
antibody will have a similar affect to deleting or mutating the sorting signal, that is: it will 
result in the decrease or abolition of vacuolar sorting of the antibody. Alternative ways of 
'masking' the C-terminal sorting signal are envisaged, e.g. other 'inocuous 1 amino acids, e.g. 
Asparagine or Alanine may be located downstream of the C-terminal sorting signal. Such 
C-terminal extensions can arise easily without affecting the proper folding and activity of 
proteins. 

Accordingly, preferably the region encoding the C-terminus is modified by the addition of 
nucleotides encoding 1 or more, typically 2 to 8, especially 2 to 4 amino acid residues, such 
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as glycine, asparagine or alanine. This modification may be downstream of the C-terminal 
sorting signal. 

Amino acids are encoded by codons of three nucleotides. The codons encoding each amino 
acid are conserved between organisms. Changing one or more nucleotides in a codon for a 
different nucleotide can change the amino acid encoded by the codon. 

The targeting signal preferably encodes one or more amino acids which inhibit the 
secretion of the heavy chain polypeptide from the cell. This may be a signal which inhibits 
secretion from the endoplasmic reticulum and/or from a vacuole within a cell. 

It is expected that such sequences are likely to affect secretion from the cells of organisms 
such as higher eukaryotic animals including mammals, insects and Crustacea, in addition to 
plant cells. Such organisms are thought to have sorting machinery that could potentially 
misinterpret amino acid sequences present on a foreign protein. 

Accordingly, preferably the host cell is a higher eukaryotic animal cell or a plant cell. Most 
preferably the plant cell is a dicotyledonous plant cell, that is from a flowering plant whose 
embryos have two seed halves or cotyledons. Alternatively, the plant may be a 
monocotyledonous plant, that is a flowering plant whose embryos have one cotyledon or 
seed leaf. 

OThe plant may also be a lower plant, that is any non-flowering plant including ferns, 
gymnosperms, conifers, horse tails, club mosses, liverworts, hornworts, mosses, red algae, 
brown algae, gametophytes, sporophytes of pteridophytes, and green algae. 

The modified nucleotide sequence may be inserted into the host cell by means of any 
techniques known in the art. For example, if the host cell is a plant, 
Agrobacterium-mediated plant transformation, protoplast transformation, gene transfer into 
pollen, injection into reproductive organs and injection into immature embryos may be 
used. Additionally, particle guns may be used to introduce the nucleotide material into 
plant cells. Examples of such methods are described in detail in, for example, US 
6,417,429. 

Preferably, the host cell is within a mature organism, such as a mature transgenic plant. 
Alternatively, the host cells may be grown in tissue culture, for example in the form of a 
callus or suspension of cells. 
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In the preferred method of making the cells expressing the antibody molecule, a first 
nucleotide sequence encoding the modified antibody heavy chain is inserted into the 
genome of a first plant to produce a first transformant A second nucleotide sequence 
including an immunoglobulin light chain portion-containing polypeptide is introduced into 
a second plant to produce a second transformant The two transformants are then 
cross-pollinated to produce a progeny population. The transgenic plants are then isolated 
from the progeny population, the transgenic plants producing an antibody molecule 
containing a heavy chain with the modified C-terminus. 

As already indicated, such methods of producing plants are discussed in, for example, US 
6,417,429. 

Preferably, the nucleotide sequence encoding the immunoglobulin heavy chain comprises a 
leader sequence forming a secretion signal. This allows the immunoglobulin heavy chain 
to be secreted from the cell. The nucleotide sequence encoding the immunoglobulin light 
chain would also preferably have a secretion signal to allow that light chain to be secreted. 
Such secretion sequences are normally sheared off after secretion of the proteins from the 
cell. 

The nucleotide sequences also preferably contain one or more control sequences, such as 
promoters and termination sequences, to allow the production of the heavy chain to be 
regulated. The promoters may themselves be inducible or constitutive. The promoters may 
also be temporally regulated, where the promoter is controlled depending on the time 
during development of the host cell. Additionally, or alternatively, the promoter may be 
spatially regulated, that is it is regulated depending on where the host cell is within an 
organism, for example in order to specifically express the heavy chain within the leaves or 
roots of a plant. 

Vectors including the modified nucleotide sequence are also included within the scope of 
the invention. Vectors are molecules which serve to transfer nucleotides of interest into a 
cell. Such vectors are well known in the art and will vary depending on the host cell. For 
example, viral vectors, such as adenoviral or baculoviral vectors may be used with 
mammalian or insect cells respectively. Alternatively, plasmids based upon the Ti plasmid 
from Agrobacterium tumefaciens may be used, for example, to introduce material into 
plant cells. 

Host cells and whole transgenic organisms containing the modified nucleotide sequences 
and/or vectors, are also included within the scope of the invention. 
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The nucleotide sequences modified to remove the targeting signal, that is, the nucleic acid 
sequence encoding the targeting signal in the final antibody heavy chain, may be modified 
by either deletion of that nucleotide sequence or the changing of the nucleic acid sequence 
so that it no longer encodes for that targeting sequence by the insertion, deletion or 
changing of one or more nucleic acids within the sequence. 

Preferably, the nucleotide sequence modified encodes the amino acid sequence: 
X1X2X3 V S X4 where X, =N, H or L 
X 2 = VorY 
X3 = SorN 

X4 = an aliphatic amino acid, especially V or L. 

Preferably, the amino acid sequence is N V S V S V. This has been identified by the 
inventors as being a targeting signal. 

Two other targeting signals have been identified within the C-terminus amino acid 
sequence of the heavy chain. The inventors have noted that leucine and isoleucine amino 
acids are present. In other molecules, these two amino acids are typically, sites of 
glycosylation and are thought to have an effect on the secretion of proteins from cells. 

Accordingly, one or more nucleic acids encoding isoleucine or leucine may be modified. 

One possible way of modifying the nucleotide sequence is simply deleting nucleotides so 
that two or more amino acids are deleted from the final modified antibody heavy chain. 
The inventors have realised that this is potentially disadvantageous. The cysteine residue 
which is usually found two amino acids from the C-terminus end of the heavy chain, is 
involved in the binding of J-chain polypeptides. J-chain polypeptides allow the formation 
of dimeric antibodies or pentemeric antibodies and improve the stability of the secreted 
antibodies. There is therefore a need to retain or replace the cysteine residue at this 
position. Accordingly, some of the nucleotides encoding the C-terminus 18 amino acids of 
the completed heavy chain may be deleted and replaced by a synthetic nucleotide sequence 
encoding an amino acid sequence and general formula: 

-(XaaOm C(Xaa 2 ) n 

where: C = a cysteine residue 

Xaai = independently any amino acid with the proviso that it is not 
selected from I, L or forms a consecutive sequence Xi X 2 X 3 V S X4 
where: Xi=N, HorL 
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X 2 = VorY 
X 3 = SorN 

X4 = aliphatic amino acid 



Xaa 2 



m 



independently any amino acid 
an integer of at least 2 
an integer of 0 to 5. 



Preferably m = 4-20, especially 5 to 10, and n = 0 or 1, especially 1 . Preferably Xaa t is not 
N V S V S V. Most preferably Xaa 2 is Y or A, especially A. 

Such a sequence may also be added to an existing heavy chain sequence to impart J-chain 
binding capability. 

A method of adding J-chain binding ability to the heavy chain of an antibody comprising 
the steps of: 

(a) providing a nucleotide encoding an immunoglobulin heavy chain; 

(b) adding to that nucleotide at the position of the nucleotide encoding the C-terminal 
end of the heavy chain, a nucleotide sequence encoding a synthetic tail with the 
amino acid sequence: 

-(Xaa,) m C(Xaa 2 )n 

which is defined as above; 

(c) expressing the completed nucleotide in a host cell to form immunoglobulin heavy 
chain capable of binding J-chain. 

Preferably, the tail is added at the position of the nucleotide codon encoding the C-terminal 
end of the final, processed, heavy chain. 

The amino acid sequence before the cysteine residue is preferably designed to ensure that 
the sequence is devoid of any secondary structure so that the cysteine residue is exposed to 
the medium surrounding the antibody to allow the cysteine residue to efficiently bind to 
J-chains. 



Preferably the modified amino acid is one or both of an isoleucine, 3 amino acids and/or 10 
amino acids from the C-terminus end of the completed heavy chain. 
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Preferably, the nucleotide sequence which is modified is contained within a nucleotide 
sequence encoding the sequence: 

PTX 1 X 2 X3VSX4X5X 6 X 7 X 8 X 9 X 1 oXhX 12 CY 
where: Xi = N, H or L, preferably N 
X 2 - V or Y, preferably V 
X 3 = SorN 

X4 = an aliphatic amino acid, preferably V or L 
X 5 = an aliphatic amino acid, preferably I, V or L 
Xe = M, V or L, especially M 
X 7 = SorA 
Xg = EorD 

X 9 = any amino acid, preferably G, V, A or T 

Xio= D, E, G or A, preferably D 

Xu = G or S, preferably G 

X, 2 = I,T,V,ZorA,preferablyIorT. 

Xn = may or may not be present and, where present is A or Y, preferably A. 

This is based upon a consensus sequence which has been derived from a study of the 
C-tenninus ends of heavy chains from IgA and IgM molecules (see Figure 8). 

The method of the invention additionally comprises the step of isolating and purifying the 
antibody molecules. The antibody molecules may be harvested and purified by methods 
known in the art. For example, if the host is a plant cell, then a portion of the transgenic 
plant may be homogenised to form a plant pulp and the antibody extracted from the pulp. 

The extracted antibody may be subjected to a protease digest, for example with papain or 
pepsin to form Fab or F(ab') 2 fragments. 

Antibodies obtainable by the methods of the invention are also provided. 

The method also provides an isolated antibody containing a heavy chain comprising an a3 
domain or a mu domain, the a3 domain or mu domain lacking one or more targeting 
signals towards the C4erminus end. Preferably, the targeting signal is not within the last 
1 8 amino acids of the C-terminus end of the heavy chain. 

Preferably, the antibodies of the invention do not contain the targeting signal: 
X 1 X 2 X 3 VSX 4 
where: X, - N,HorL 
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X 2 = VorY 
X 3 = SorN 

X4 = aliphatic amino acid 
Preferably, the targeting signal is N V S V S V. 



The antibody of the invention preferably contains one or no isoleucine or leucine amino 
acids within the last 1 8 amino acids of the C-terminus of the heavy chain of the antibody. 

Preferably, the antibody contains, at the C-terminus end of the heavy chain, the sequence; 
-(Xaai)mC(Xaa 2 )n 

where: C = cysteine residue 

Xaai = independently any amino acid with the proviso that it is not I or L 
or forms a consecutive sequence Xi X 2 X 3 V S X4 
where: Xj = N, H or L 
X 2 = VorY 
X 3 = SorN 

X4 = aliphatic amino acid 
Xaa 2 - independently any amino acid, especially Y or A 
m = an integer of at least 2 
n = an integer of 0 to 5, preferably 1. 

Antibodies capable of binding J-chain peptides are also 'provided, the antibodies 
comprising at their C-terminal end the sequence: 
-(Xaa,) ra C(Xaa 2 ) n 
where: C = cysteine residue 

Xaaj = independently any amino acid with the proviso that it is not I or L 
or forms a consecutive sequence Xi X 2 X 3 V S X4 
where: Xi = N, H or L 
X 2 = VorY 
X 3 = SorN 

X4 = aliphatic amino acid 
Xaa 2 = independently any amino acid, preferably Y or A 
m = at least 2 
n = 0to5, especially 1. 



Preferred synthetic tailpieces formed by such sequences include: 
(a) SCMVGHEALPMNFTQKTTDRLSG KPACY 
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(b) SCMVGHEAIPMNFTQKTTORI^GKPAAACY 

(c) SCMVGHEALPMNFTQKTroRLSGKPHASTPEPDPVACY 
and 

(d) SCMVGHEALPMNFTOKTIDRLSG KPAAAAACY 

Modifying the C-terminal end still further to include PAAAAACY at the C -terminus 
produced an improvement in antibody secretion. 

Further improvement is expected by removing the C-terminal tyrosine residue and 
optionally replacing it with Alanine. The terminal tyrosine is expected to be a potential 
vacuolar targeting signal. Frigerio, L., et aL (The Plant Cell, Vol. 10 (1998), pages 
1031-1042) looked at phaseolin, a legume protein in plants. Deletion of four C-terminal 
residues {Ala Phe Vol Tyr) was found to prevent vacuolar targeting. Unpublished results 
by the authors of this paper also found that at least 50% of the vacuolar targeting was due 
to the C-terminal tyrosine residue. 

It is also noted that other vacuolar sorting sequences comprising a C-terminus tyrosine 
residue have also been identified in other plant proteins such as albumin from rape and 
Arabidopsis (see Matsuoka K. and Neuhaus J.M., J. Exp. Bot., Vol. 50 (1999), pages 
165-174). 

Hence, more preferably the antibody comprises a C-terminus sequence selected from: 
-PAAAAACA and 
-PAAAAAC. 

Hence, preferably the antibody does not comprise a C-terminal tyrosine residue. 

The antibodies of the invention may be used in the manufacture of medicaments to treat 
disease. Alternatively, they may be administered to a patient to treat disease or 
administered as prophylaxis. 

The antibodies in the invention may also be used, for example, in the production of an 
assay or other techniques known to use antibodies. 

Oligonucleotides for use in the manufacture of the antibodies of the invention are also* 
provided. These include: 5'-ccatcgatggaatggacctgggttttt-3', 5'-ccctctagactagtagc 
ataggccatc-3 \ 5 ' -actgtagacaattccgccacctcagcctaca-3 5 ' -tgtaggctgaggtggcggaattgtctac 
agt-3 \ 5 ' -gagcagctcaacagcgttttccgctcagtcag-3 5 , 5 '-ctgactgagcggaaaacgctgttgagctgctc-3 ' , 



WO 2004/046190 



13 



PCT/GB2003/004983 



5 ' -ttgcccatgaacttcgtccagaagaccatcga-3 ' , 5 5 -tcgatggtcttctggacgaagttcatgggcaa-3 ' , 5'-aaaccc 
accaatgtcgctgtgtctgtgatcatg-3 5 , 5 5 -caigatcacagacacagcgacattggtgggttt-3 ' , 5'-aaacccaccaatg 
tcgctgtgtctgtgateatg-3 \ 5'-catgatcacagacacagcgacattggtgggttt-3', 5'-ccctctagactatttacccgac 
agacggtc-3', 5 5 -gagcag ctcaacagcgttttcxgctcagtcag-3 ' . 

The invention will now be described by way of example only, with reference to the 
following figures: 



Figure 1. Schematic representation of the constructs used in this study. 

All constructs were fused downstream of the CaMV35S promoter in the expression vector 
pJLH38. SP, signal peptide. The hearts indicate the glycosylation sites in the heavy chains. 

Figure 2. Assembly and intracellular fate of transiently expressed Ig chains. 

Tobacco mesophyll protoplasts were transfected with plasmids encoding k and a or A: and 
y/a chains, respectively. Cells were labelled for 16 hr with 35S methionine and cysteine. 
Total cell homogenates (c), purified vacuoles (v) or incubation media (m) were 
immunoprecipitated with anti gamma or anti IgG antisera. Polypeptides were visualised by 
SDS-PAGE and fluorography. 

Figure 3. Unassembled light chain is secreted as a monomer. 

A. Protoplasts were transfected with plasmids encoding k chain, y/a heavy chain or both 
chains together and labelled for 16 h. Cell homogenates and incubation media were 
immunoprecipitated with anti IgG and proteins resolved by reducing or non-reducing 
SDS-PAGE and fluorography. 

B. Protoplasts were transfected with plasmid encoding k chain and labelled for 16 h. The 
cell homogenate and incubation medium were subjected to sedimentation velocity 
centrifugation on a linear 5-25% sucrose gradient Gradient fractions were 
immunoprecipitated with anti IgG antiserum and polypeptides resolved by SDS-PAGE and 
fluorography.T, total (unfractionated) sample. Numbers at the bottom indicate molecular 
mass markers in kilodaltons. 

Figure 4. IgA/G travels through the Golgi complex. 

A. Protoplasts were transfected with plasmids encoding k chain and y/a heavy chain (wt), 
g/a lacking the two C- terminal glycosylation sites (A3,4) or y/a lacking all four 
glycosylation sites (Al ,2,3,4), respectively. Cells were pulse-labelled for 16 hours, 
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homogenized and immunoprecipitated with anti IgG antiserum. Proteins were visualised by 
SDS-PAGEand fluorography. 

B. Cells were transfected with plasmids encoding k and g/a chains and treated as in A. 
Immunoprecipitates were subjected to treatment with endoglycosydase H (endo H) or 
buffer (control) prior to SDS-PAGE analysis. The arrows indicate the position of vacuolar 
fragmentation products. The asterisk indicates the position of the k light chain. Numbers at 
left indicate molecular mass markers in kilodaltons. 

Figure 5. Vacuolar delivery of IgA/G is not caused by stress. 

Protoplasts from untransformed, wild type (SRI) or transgenic tobacco plants expressing 
IgA/G or IgG were transiently transfected with plasmids encoding phaseolin A418. Cells 
were labelled for lhr with 35S methionine and cysteine and chased for the indicated 
periods of time. Total cell homogenates were immunoprecipitated with anti phaseolin 
antiserum and polypeptides were visualised by SDS-PAGE and fluorography. The 
arrowhead indicates the position of intact phaseolin. The vertical bar in panel B indicates 
the position of vacuolar fragmentation products of phaseolin. Numbers at left indicate 
molecular mass markers in kilodaltons. 

Figure 6. Comparison of the C-terminal constant domains of IgA and IgG. 

Comparison of the C-terminal tailpieces of the constant domains of IgA/G and IgG heavy 
chains. Amino acid sequence alignment was generated using the MegAlign program 
(DNASTAR Inc., Madison, USA). Vertical bars identify identical amino acid residues The 
putative N-glycosylation sites in y/a are italicised The J-chain binding cysteine residue is 
underlined. 

Figure 7. Deletion of the Ccc3 tailpiece results in secretion of IgA/G. 

A. Tobacco protoplasts weere transfected with plasmids encoding k chain and y/a, y or 
y/aAC18 heavy chains, respectively. Cells were pulse labelled for 1 h and chased for the 
indicated periods of time. Cell homogenates and incubation media were 
immunoprecipitated with anti IgG antiserum. Proteins were visualised by SDS-PAGE and 
fluorography. Numbers at left indicate molecular mass markers in kilodaltons. White 
arrows indicate vacuolar fragmentation products. 

B. The fluorograms shown in A were subjected to densitometry to quantify the amount of 
secreted heavy chains, which accurately reflects the secretion of assembled antibody. 
Secreted heavy chains are expressed as percentage of total intracellular heavy chains 
immunoselected at 0 h chase. 

Figure 8. Sequence homology of the C-terminus ends of IgA and IgM heavy chains. 
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Figure 9. An artificial C-terminal tail PAAAAACY allows J chain binding and 
dlgA/G ennamer assembly. 

Tobacco protoplasts were transfected with plasmids encoding k chain and y/a, y/aAC18 or 
y/aAC18P(A) 5 CY heavy chains, respectively, either in the presence or in the absence of 
plasmid encoding the J chain. Cells were pulse labelled for 1 L Cell homogenates were 
immunoprecipitated with anti IgG antiserum. To analyse their assembly state, proteins 
were resolved by non-reducing SDS-PAGE and visualised by fluorography. Note that the 
efficiency of ennamer assembly of P(A) 5 CY is comparable to the wild-type IgA/G. AC 18 
is incapable of binding the J chain as it lacks the C-terminal cysteine. 

Figure 10. The artificial P(A) 5 CY tail allows for improved antibody secretion. 

Tobacco protoplasts were transfected with plasmids encoding k chain, J chain and y/a, 
y/aAC18 or y/aAC18P(A) 5 CY heavy chains. Cells were pulse labelled for 1 h and chased 
for the indicated periods of time. Cell homogenates and incubation media were 
immunoprecipitated with anti IgG antiserum. Proteins were visualised by reducing 
SDS-PAGE and fluorography. The fluorograms were subjected to densitometry to quantify 
the amount of secreted heavy chains. Secreted heavy chains are expressed as percentage of 
total intracellular heavy chains immunoselected at 0 h chase. Note that at 8 hours, recovery 
of P(A) 5 CY in the medium is 2.3-fold higher than recovery of IgA/G. 

MATERIALS AND METHODS 
Transgenic plants 

Transgenic Nicotiana tabacum cv Xanthi plant lines expressing assembled IgG and IgA/G 
under the control of the cauliflower mosaic virus 35S-promoter have previously been 
described . For protoplast transfection experiments, wild-type N. tabacum cv Petit Havana 
SRI was used Plants were grown in axenic conditions under a 12 hour light-dark regime. 

Recombinant DNA 

All DNA manipulations were performed using established procedures. 
The full length IgA/G g/a heavy chain was amplified from the binary vector pMON530 
using the polymerase chain reaction. The oligonucleotides 5'-ccatcgatggaatggacctgggttttt-3' 
and S'-ccctctagactagtagcataggccatc-S' containing Clal and Xbal restriction sites before the 
start codon and after the stop codon for cloning purposes, were used. The digested PGR 
products were ligated into a pUC-based vector downstream of the CaMV35S-promoter 
(Denecke et al., 1992). The resulting plasmid was designated pJLH38. 
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The glycosylation site mutations were produced using the 'Quickchange 1 in vitro 
mutagenesis system (Stratagene, La Jolla, CA). Potential glycan sites mutations Ser 76 to 
Ala (Dglycanl pJLH40) 5 Thr 289 to Val (Dglycan2 pJLH41), Thr 526 to Ala (Dglycan3 
pJLH42) and Ser 541 to Ala (Dglycan4 pJLH43) were introduced using the 



5 ' -tcgatggtcttctggacgaagttcatgggcaa-3 1 , 5'-aaacccaccaatgtcgctgtgtctgtgatcatg-3' or 
5'-catgatcacagacacagcgacatt ggtgggttt-3' respectively, using pJLH38 as a template. 
Multiple glycan mutants D3,4 was also produced using Quickchange in vitro mutagenesis 
system (Stratagene, La Jolla, CA) with the oligonucleotides 
5'-aaacccaccaatgtcgctgtgtctgtgatcatg-3' and 5'-catgatcacagacacagcgacatt ggtgggttt-3' and 
pJLH42 as template. The glycan mutant pJLH45, containing no glycosylation sites 
(1)1,2,3,4) was produced by isolating the Clal-Ncol fragment from pJLH40, the 
NcoI-EcoRI fragment pJLH41, EcoRI-Xbal fragment from pJLH44 and ligating them into 
the pUC vector previously cut with Clal and Xbal. 

Removal of the last 18 amino acids of the g/a heavy chain (DC18 pJLH47) was achieved 
by PGR using the anti-sense oligonucleotide 5'-ccctctagactatttacccgacagacggtc-3' 
producing a stop codon followed by an Xbal site at position 537 with the sense oligo 
5'-gagcag ctcaacagcgttttccgctcagtcag-3\ The resulting PCR product was cut with EcoRI 
and Xbal and ligated into the expression vector cut with Clal and Xbal along with a 
Clal-EcoRI fragment of pJLH38. 

Phaseolin expression constructs T343F and D418 are described in Pedrazzini et al. (1997) 
and Frigerio et al. (1998), respectively. 

Protoplast transfection 

Protoplasts prepared from axenic leaves of tobacco (Nicotiana tabacum cv Petit Havana 
SRI) grown in sterile in vitro conditions under a 12 hour light- dark regime, were subjected 
to polyethylene glycol-mediated transfections as described by Predrazzini et al. (1997). 
40mg of each plasmid was used to transform 106 protoplasts in 1 ml. When only single 
antibody chains were expressed, the total amount of DNA was maintained constant among 
samples by adding 40mg of empty expression vector pDHA . After transfection, cells were 
incubated at 25°C before metabolic labelling. 

In vivo labelling of protoplasts and analysis of expressed polypeptides. 



oligonucleotides 

5 5 -tgtaggctgaggtggcggaattgtctacagt- 3 ' , 
5 ' -ctgactgagcggaaaacgctgttgagctgctc-3 * , 



5 ' -actgtagacaattccgccacctcagcctoca-3 ' , 
5 ' -gagcagctcaacagcgttttccgctcagtcag-3 ' , 
5 '-ttgcccatgaacttcgtccagaagaccatcga-3 5 , 



Pulse-chase experiments were conducted by labelling protoplasts using ProMix (a mixture 
of 35S-Met and 35S-Cys; Amersham) and chasing with an excess of cold amino acids for 
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the times stated (Predrazzini et al. 1997). After harvesting at desired time points, 
protoplasts and incubation media were frozen and homogenised by adding two volumes of 
ice-cold homogenisation buffer (150mM Tric-HCl, 150mM NaCl, 1.5mM EDTA, and 
1 .5% (w/v) Triton X-100, pH 7.5) supplemented with Complete protease inhibitor cocktail 
(Roche). Immunoprecipitation of expressed polypeptides was performed as described 
previously (Frigerio et al. 1998) using rabbit polyclonal antisera raised against mouse IgG 
(whole molecule, Sigma), g heavy chain, k light chain (Southern Biotechnology), or bean 
phaseolin (Pedrazzini et al. 1997). Digestion of immunoprecipitated proteins proteins with 
endoglycosidase H (Roche) was performed as described previously (Ceriotti et al. 1991). 
For the analysis of the secreted free kappa light chains by sedimentation velocity, after 
radioactive labelling and homogenation cells, homogenates and incubation media, were 
loaded on top of a continuous 5 to 25% (w/v) linear sucrose gradient made in 150 mM 
NaCl, 1 mM EDTA, 0.1% Triton X-100, and 50 mM Tris-Cl, pH7.5. Samples were 
centrifuged at 39,000 rpm in a SW40 Ti rotor (Beckman Instruments, Inc., Fulierton, CA) 
for 20 hours at 20°C. K light chain was then immunoselected from each gradient fraction, 
hnmunoselected proteins were resolved by 15% (w/v) non-reducing or reducing 
SDS-PAGE and revealed by fluorography. 

RESULTS 

The inventors have previously shown that the intracellular fate of decameric, secretory 
IgA/G is the same as its constituent tetrameric IgA/G sub-unit . They therefore decided to 
focus their efforts on the latter for simplicity of analysis. Tetrameric IgA/G consists of two 
k light chains and two hybrid IgA/G chains (Fig. 1). The hybrid heavy chain (denominated 
y/a ) consists of the IgG y variable domain and constant Cyl and Cy2 domains from 
monoclonal IgG Guy's 13-, fused to the constant Ca2 and Ca3 domain from a secretory 
IgA . The Ca3 domain contains the C-terminal cysteine that is responsible for binding the J 
chain and contains regions necessary for contact with the secretory component . The 
additional Coc2 domain was initially added to provide an extra affinity tag, to facilitate 
purification of the antibody from plant tissue . The presence or the absence of this domain 
does not affect the trafficking of the molecule , and the inventors 1 unpublished 
observations. 

In order to demonstrate that the fate of IgA/G molecules from transgenic plants differs 
from that of the parent IgG molecules, protoplasts obtained from transgenic plants 
expressing either IgG or IgA/G were subjected to pulse labelling for 16 hr with 35 S 
methionine and cysteine and subsequent vacuole purification. Total cell homogenates, 
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purified vacuoles or incubation media were immunoprecipitated with, anti a antiserum. The 
unbound proteins were then re-inununoprecipitated with anti whole molecule IgG 
antiserum and polypeptides were revealed by SDS-PAGE and fluorography (Fig 2A). The 
comparison of IgG and IgA/G shows that whilst IgG heavy and light chains are mainly 
recovered in the protoplast incubation medium after 16 hr (Fig. 2, lane 3), IgA/G is mostly 
found inside the cells and partially in purified vacuoles (lanes 4 and 5). Anti y recognises 
only the constant gamma domains in the heavy chains. Fig. 2A shows that this antiserum 
coselects both heavy chains (y for IgG and y/a for IgA/G) and k light chains, indicating that 
the light chains have assembled with the parental and recombinant heavy chains. 
Re-immunoprecipitation with anti IgG, that recognises the whole IgG molecule (and 
therefore most of the IgA/G molecule as well, excluding the alpha constant domains in y/a) 
reveals the presence of additional, smaller IgA/G polypeptides in the total cell homogenates 
(Fig. 2 A, lanes 10 and 11). These polypeptides are found predominantly in the vacuolar 
fraction and represent degradation products . This provides confirmation of their previous 
findings that IgA/G assembles efficiently but is secreted very slowly, with a conspicuous 
proportion of the molecules being delivered to the vacuole where they are ultimately 
degraded . 

To understand the relative contribution of the heavy and light chains to the intracellular fate 
of IgA/G, the inventors decided to use transient expression in tobacco protoplasts. This 
technique has been successfully used to study the behaviour and trafficking of proteins in 
the plant secretory pathway, although it has never been used for complex multimeric 
proteins. The transient expression system allows rapid testing of mutant constructs and 
would therefore be a valuable tool for the analysis of potential of trafficking mutants of 
IgA/G. In order to reliably use this technique in this study the inventors wanted to prove 
that the behaviour of the Ig chains was comparable in transiently and permanently 
transformed tobacco cells. To achieve these aims they cloned the Ig coding sequences 
(heavy and light chains, Fig. 1), fused to the CaMV35S promoter, into a CaMV35S -driven 
vector for transient expression. Tobacco mesophyll protoplasts were transfected with 
plasmids encoding k chain and IgA/G y/a heavy chain. Transfected cells or cells obtained 
from transgenic IgA/G plants were metabolically labelled for 2 hr with 35S methionine and 
cysteine and homogenised. Homogenates were immunoprecipitated with either anti y or 
anti k antisera. As previously mentioned, anti y recognises only the constant y domains in 
the heavy chains, whereas anti k binds to the light chain only. Both of these antisera were 
capable of co-selecting both heavy and li^bt chains, indicating assembly of the IgA 
molecule after transient and permanent expression (figure 2B, lanes 1-4). Assembly is very 
efficient, as further confirmed by the fact that the molecules migrate with the mobility 
expected for fully assembled heterotetramers when resolved on non-reducing SDS-PAGE 
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(Fig 2B, lanes 5-8) with no free heavy chain detected in either transient of stably expressing 
protoplasts. An excess of free light chain can be observed in lane 8 and will be discussed 
further below. The discrepancy observed in the mobility of the k chain between transient 
and stable expression is due to the fact that a different signal peptide was used to prepare 
the transient expression construct compared to the murine signal peptide used to generate 
transgenic plants and see Methods) As the original sequence contains additional residues 
between the predicted signal peptide cleavage site and the first codon of mature k chain , 
this probably results in a slightly different electrophoretic mobility. 

The inventors' observations in transgenic plants reveal that the IgA/G tetramer is partially 
delivered to vacuoles. This results in the appearance of degradation products after a long 
chase (Fig 2A) . To test whether this is also the case for transiently expressed IgA/G 5 they 
compared the phenotype of the molecules after a 16 hr metabolic labelling (Fig. 2C). 
Subsequent immunoprecipitation with anti IgG, which recognises the whole IgG molecule, 
reveals the presence of small polypeptides resulting from degradation (figure 2C) when the 
IgA/G molecule is expressed transiently. The sizes of these fragments are comparable in 
both transiently and stably expressed antibody chains. They therefore conclude that 
fragments observed in transient expression are also the result of vacuolar delivery. 
Therefore, both the assembly and intracellular transport events of IgA/G are faithfully 
reproduced in transient expression using tobacco mesophyl protoplasts. 

From Fig. 2B (lane 8) and their previous observations in transgenic plants it appears that in 
cells expressing heavy and light chains (or other chains in addition to them), an excess of 
free, unassembled light chain can be detected. This also seems to be the case with 
transiently expressed chains. This excess is reflected by an increased amount of light chain 
in the medium, suggesting that the free light chain be secreted. To test this, the inventors 
transiently transformed tobacco protoplasts with plasmid encoding the k chain, the y/a 
heavy chain, or both chains together and subjected them to metabolic labelling for 16 
hours. Protoplasts were then homogenised and immunoprecipitated with anti IgG. Fig. 3 A 
shows that immunoprecipitation with anti IgG antiserum reveals the presence of a larger 
amount of k light chains compared to heavy chain in both the cells and the incubation 
medium. When the samples are resolved on non-reducing SDS-PAGE (Fig. 3A, lanes 7-12) 
it appears evident that the excess k light chain observed is free k light chain not associated 
with heavy chain. In mammalian cells there is some evidence that free unassembled light 
chain can be secreted as covalent or non-covalent dimer . From the experiment in 
non-reducing conditions (Fig. 3 A, lanes 10 and 12) it is clear that there are no 
disulfide-linlced dimers of k. To determine if non-covalent dimerisation occurs prior to 
secretion of k chain in plants, they transfected protoplasts with k chain and metabolically 
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labelled them overnight They then loaded cell homogenates and incubation medium onto 
linear 5-25% sucrose gradients and subjected them to sedimentation velocity 
centrifugation, along with molecular mass markers. Gradient fractions were then 
immunoprecipitated with anti IgG antiserum (Fig. 3B). Clearly, the vast majority of 
secreted light chain is retrieved in fractions of mass around 30 kDa, a size compatible with 
the monomeric form. It was concluded that e is secreted as a monomer in tobacco 
mesophyll protoplasts. 

In previous studies the inventors have shown that transport of IgA/G to the vacuole can be 
inhibited by the fungal metabolite Brefeldin A . They used that evidence to conclude that 
IgA/G was transported through the Golgi complex. Recently, however, it has become clear 
that there exist routes from the plant ER to the vacuole, which, although prone to BFA 
inhibition, seem to bypass the Golgi . In the light of this new evidence it is clear that BFA 
inhibition alone can no longer be used as a sole evidence for transport through the Golgi 
complex. Thus, in order to test whether IgA/G does indeed travel through the Golgi they 
have studied the state of one of its N-linked glycans. 

One prominent IgA/G vacuolar degradation product migrates above the k chain around 30 
kDa. This fragment is comprised of the variable and constant gamma domains and in 
non-reducing conditions is associated with the k light chain to yield the Fab fragment (data 
not shown). When all glycosylation sites in yla are abolished (Dl,2,3,4, Fig. 4A), the 30 
kDa fragment seems to disappear (Figure 4A, lane 3). The same phenotype is observed 
when wild-type heavy chain is expressed in the presence of tunicamycin, an inhibitor of 
N-linked glycosylation (Fig. 4B, compare lanes 1-2 with lanes 3-4). The observed 
disappearance of such fragment upon tunicamycin treatment or mutation of the 
glycosylation sites is likely due to the fact that the mobility of the becomes very similar to 
that of the k chain, resulting in co-migration of the two polypeptides. The fact that the 
mobility shift is only detectable in the mutant y/a chain where all four putative 
glycosylation sites have been removed (figure 4A^ lane 3) but not in the mutant chain 
carrying mutations in the two sites located in the a domain (D3,4, figure 4 A, lane 2) 
provides further evidence that the 30 kDa fragment originates from the y domain of the 
hybrid y/a heavy chain. 

As fragmentation occurs in the vacuolar compartment it was expected that the glycan 
present on the degradation product carry a complex (i.e. Golgi-modified) glycan if the 
molecule has travelled through the Golgi complex en route to the vacuole. If this is the 
case, this glycan should be resistant to endoglycosidase H (endo H) digestion. Endo H 
resistance is a reliable indication that a glycoprotein has travelled through the medial and 
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trans-Golgi complex . Indeed, treatment with endo H after transient expression, metabolic 
labelling and subsequent immunoprecipitation, does not affect the mobility of the 30 kDa 
fragment (Fig. 4B, compare lanes 2 and 6) indicating that this glycan has acquired endo H 
resistance. It is however clear that the enzyme is active, as the mobility of the heavy chain 
is increased by endo H treatment at the end of the pulse, similarly to what observed in the 
presence of tunicamycin (Fig. 4B, compare lanes 1, 3 and 5). This is expected as, shortly 
after synthesis, whilst still in the ER, the heavy chain glycans are in the high-mannose, 
endo H -sensitive form. 

Based on the results shown, it was therefore concluded that transport of IgA/G to the 
vacuole occurs through the Golgi complex. 

Is the observed vacuolar delivery of a proportion of SIgA/G the result of quality control or, 
rather, the result of stress imposed to the secretory system by the over-expression of this 
heterologous protein? Could vacuolar targeting be the effect of saturation of secretion? 
This has never been reported, although the opposite can occur: over-expression of vacuolar 
proteins can lead to their partial secretion [Frigerio, 1998]. To test this possibility, cells 
from plants expressing the parent IgG or SIgA/G were transfected with plasmid encoding a 
secreted form of the storage protein phaseolin, A418 . This phaseolin mutant is normally 
secreted very efficiently after visiting the Golgi complex. If expression of the antibody is 
saturating the ER capacity and imposing stress onto the system to the point of reducing 
secretion, it would be expected that secretion of A418 would be inhibited as well. 
However, Fig. 5 shows that in the antibody-expressing cells, the fate of phaseolin is not 
affected and the protein is still secreted with the same efficiency observed in wild-type 
cells. It is therefore concluded that vacuolar delivery of IgA cannot be attributed to stress 
to the endomembrane system. 

To further explore the possibility of a spurious effect due to stress on the secretory system, 
the inventors wanted to test whether expression of IgA/G was affecting transport of a 
normally vacuolar protein. They used the well studied phaseolin variant T343F, which is 
transported to the vacuole in tobacco protoplasts. Phaseolin undergoes proteolytic 
fragmentation in the vacuole, so the appearance of fragments is indicative of vacuolar 
delivery . In the antibody-expressing cells, phaseolin still delivered to vacuoles, with 
efficiency comparable to wild-type cells. A proportion of phaseolin molecules is secreted in 
the incubation medium. They have previously shown that this is due to saturation of the 
sorting machinery during transient expression In IgA/G expressing cells, there is no shift 
in the ration vacuolar/secreted phaseolin. This indicates that expression -and vacuolar 



WO 2004/046190 



22 



PCT/GB2003/004983 



delivery- of IgA/G is not competing with vacuolar sorting of phaseolin, suggesting that the 
two proteins may be sorted to the vacuole by different mechanisms. 

The work presented so far shows that the IgA/G tetrameric molecule is transported to the 
vacuolar compartment via the Golgi complex where it is ultimately degraded As this 
intracellular fate differs dramatically to the fete of the parental IgG molecule and the 
inventors have ruled out the possibility of trafficking events occurring as a result of ER 
stress, the most likely explanation for IgA/G delivery to the vacuole is the existence of 
positive, albeit cryptic, sorting information within the molecule. Clearly, this signal must 
be absent from the parent IgG as this is secreted efficiently . y and y/a heavy chains differ 
in that y/a has two additional, constant a domains. The inventors believe that the signal 
must be exposed and they therefore compared the C-terminal regions of the two heavy 
chains. Sequence alignments of the IgG y3 domain and the IgA/G a3 domains reveals that 
the <x3 domain is 18 residues longer than the y3 (Fig. 6). This extension constitutes the 
tailpiece containing the cysteine residue required for J chain binding . This tailpiece is rich 
in hydrophobic residues, a feature typical of C-terminal vacuolar sorting signals (ctVSS) . 
Given the heterogeneous nature of ctVSS, the inventors believe that the tailpiece is 
recognised as a vacuolar targeting signal within plant cells. To test this hypothesis, they 
deleted the 18-residue C-terminal tail to produce a truncated heavy y/a chain, denominated 
AC1 8. They co-transfected protoplasts with plasmids encoding the k light chain and y/a , y 
or AC18 heavy chains, respectively, and subjected them to pulse-chase analysis. When 
co-expressed with light chain, AC 18 assembled correctly (not shown) and was secreted 
with kinetics comparable to IgG (Fig. 6A). Note that whilst the low extracellular amount of 
IgA/G chains remains constant, and is probably the result of cell breakage during sample 
handling before homogenisation, the amount of secreted IgA/G containing DC 18 heavy 
chains increases steadily with time, with kinetics comparable to the secretion of the IgG 
chains (Fig. 7, compare lanes 13-16 with lanes 21-24). The increased secretion of DC18 
was paralleled by a decrease in the appearance of vacuolar degradation fragments, as 
revealed by overexposing immunoprecipitates after a long labelling (Fig. 6B). This shows 
that deletion of the C-terminal 18 amino acids of the IgA/G heavy chain results in reduced 
vacuolar targeting and increased secretion of the assembled tetramers. Therefore the 
inventors concluded that the C-terminal tail of the hybrid y/a chain is responsible for 
vacuolar delivery of IgA/G . 
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DISCUSSION 

Plants are unique among non-animal systems in their ability to assemble complex 
multimeric proteins. In this work the inventors have studied the intracellular fate of a 
hybrid secretory immunoglobulin IgA/G and gained further insight on how plants respond 
to the expression of a complex heterologous protein. The efficiency of IgA/G assembly in 
the plant ER is virtually 100%. In this respect, the plant ER is a very versatile and efficient 
folding compartment for foreign proteins, as shown by other studies . Similarly to 
mammalian cells, the molecular chaperone BiP is involved in binding unassembled heavy 
chains and it is released upon co-expression of the companion light chains (Nuttall and 
Frigerio, unpublished results). 

Single chains 

Single light chains have a different fate. Plants secrete free or unassembled k chain with 
kinetics typical of 'normal 1 secretory proteins . In contrast with animal cells, where single 
light chains can either be retained in 1he ER, degraded, or secreted as disulfide-bonded 
dimers , unassembled k is quantitatively secreted as a monomer. 

Transport through the Golgi 

With regards with synthesis, translocation into the ER and quality control, the inventors 1 
findings indicate that plants are perfectly happy to produce large amounts of antibody. The 
ER is not stressed, indeed it is able to accept much more heterologous protein. As for 
anterograde transport, their previous observations in transgenic plants showed that 
transport of slgA/G to the vacuole can be inhibited by brefeldin A treatment . Recently, 
however, the inventors have demonstrated that BFA can inhibit vacuolar transport of a 
protein that does not travel through the Golgi . To provide final evidence for the 
involvement of the Golgi in the transport of IgA/G the inventors have therefore analysed 
the state of one of its N- linked glycans. This glycan clearly acquires endoglycosidase H 
resistance after a chase, confirming glycan processing in the Golgi. 

Vacuolar delivery 

For plant -made IgA/G, the problems start post-assembly and, presumably, post-Golgi. A 
large proportion of the molecules, which are expected to be secreted, are actually 
transported to the vacuole where they slowly undergo proteolytic fragmentation. The 
inventors have shown that this is not a stress response from the plant but the result of 
cryptic signalling. 
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C-ter tail sequence and sorting to vacuoles 

It remains to be seen whether this signal is recognised by a low affinity receptor or if 
targeting occurs via different mechanism. The inventors have previously shown that IgA/G 
does not interact with microsomal membranes . At present it is known that IgA/G and 
phaseolin do not compete for sorting (Fig.SB). Whether this is due to different targeting 
mechanisms or to a high capacity of the sorting machinery remains to be established. 

Biotechnological perspectives 

The ultimate goal of secretory antibody expression in plants is to produce large amounts of 
protein in a way that allows for easy purification. Protein secretion by the roots is a most 
promising approach . Maximising SIgA/G secretion is therefore an attractive goal. The 
inventors have identified the signal that prevents a vast proportion of the immunoglobulin 
molecules from being secreted. Deletion of this signal leads to hybrid IgA/G secretion at 
levels comparable to IgG secretion, indicating that no further intracellular processes exist 
to prevent exocytosis. This is the first essential step towards the production of a fully 
secreted complex immunoglobulin. The inventors are aware that complete deletion of the 
C-terminal tail, which also contains the C-terminal cysteine involved in J chain binding, is 
too drastic as it also prevents assembly of the decameric molecule. 

Homology studies of IgM heavy chains show high levels of homology with IgA C-terminal 
ends. Hence, antibodies containing IgM heavy chains are also expected to have similar 
secretion problems. 

The inventors have noted that deletion of the C-terminal end of the heavy chains of IgA 
and IgM results in deletion of the C-terminal cysteine residue responsible for J-chain 
beinding. The presence of the J-chain increases peptidase resistance and stability of the 
antibody. This can be restored by adding an artificial peptide chain to the end of the 
antibody to restore J-chain binding ability. The peptide chain lacks the secondary structure 
encoding the sites that block secretion in native antibody heavy chains. 
Synthetic tailpieces (underlined) which are expected to work in transfected tobacco cells 
include: 

(a) SCMVGHEALPMNFTQKTTORLSGKPACY 

(b) SCMVGHEALPMNFTOKTIDRLSGKPAAACY 

(c) SCMVGHEALPMNFTOKTIDRLSGK PHASTPEPDPVACY 

Ig heavy chains containing synthetic tailpieces will be assayed for their ability to bind J 
chain and to afford secretion of assmebled, multimeric antibodies. This will be initially 
established by transient gene expression in tobacco mesophyll protoplasts. Plasmids 
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encoding modified heavy chains, light chains and J chain were co-transfected into 
protoplasts. Assembly of the (Heavy 2 Light 2 )2J complex and secretion of the complex were 
analysed in vivo by pulse-chase analysis, followed by immunoprecipitation with specific 
antisera, reducing or non-reducing SDS-PAGE and fluorography. 

Figures 9 and 10 show that using the -PAAAAACY tailpiece as the C-terminus end of the 
heavy chain, whilst the efficiency of ennamer assembly of -PAAAAACY is comparable to 
the wild-type IgA/G, recovery of -PAAAAACY in the medium was 2.3-fold higher than 
wild-type IgA/G. 

Further improvements are expected by deletion of the C-tenninal tyrosine and optionally 
replacing it by alanine. 
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Claims 

1. A method of making an antibody molecule, the antibody containing an 
immunoglobulin heavy chain comprising a a3 domain or a mu domain, the method 
comprising: 

(a) Providing a nucleotide sequence encoding the immunoglobulin heavy chain; 

(b) Modifying the nucleotide sequence in the region of the nucleotide sequence 
encoding the C-terminus 18 amino acids of the completed heavy chain to remove, or 
reduce the effectiveness of, one or more vacuolar targeting signal sequences to form a 
modified nucleotide sequence; 

(c) Inserting the modified nucleotide sequence into a host cell; and 

(d) Causing the host cell to express the modified nucleotide sequence to form the 
modified antibody heavy chain and secrete the modified antibody heavy chain from the 
host cell. 

2. A method according to claim 1 wherein the heavy chain molecule is IgA, IgM or an 
IgA/G hybrid. 

3. A method according to claim 1 or claim 2, wherein nucleotide sequence is modified 
by one or more point mutations of the nucleotide sequence, deleting one or more 
nucleotides, adding one or more nucleotides and/or replacing one or more nucleotides with 
a synthetic nucleotide sequence. 

4. A method according to claim 3, wherein the synthetic nucleotide sequence encodes 
an amino acid sequence of general formula: 

-(Xaa,) ra C(Xaa 2 )„ 

where: C = a cysteine residue 

Xaai = independently any amino acid with the proviso that it is not from 
I, L or forms a consecutive sequence Xi X 2 X 3 V S X4 
where: Xi = N, H or L 
X 2 = VorY 
X 3 = SorN 

X4 - aliphatic amino acid 
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Xaa 2 = independently any amino acid 
m = at least 2 
n = Oto5. 

5. A method according to claim 4, wherein Xaa 2 is Y and n = 1. 

6. A method according to any preceding claim, wherein nucleotides encoding the last 
1 6 amino acids of the heavy chain are deleted. 

7. A method according to any preceding claim wherein one nucleotides encoding an 
amino acid sequence having a formula: 

(a) SCMVGHEALPMNFTQKTIDRLSGKPACY, 

(b) SCMVGHEALPMNFTQKTDDRLSGKPAAACY, 

(c) SCMVGHEALPMNFTQKTIDRLSGKPHASTPEPDPVACY or 

(d) SCMVGHEALPMNFTQKTEDRLSGKPAAAAACY 

8. A method according to any preceding claim wherein the nucleotide sequence 
modified originally encoded the amino acid sequence: 

Xi X 2 X 3 V S X4 
where: Xi = N, H or L 

X 2 = VorY 

X 3 = SorN 

X4 = aliphatic amino acid. 

9. A method according to claim 8, wherein the amino acid sequence is: N V S V S V. 

10. A method according to any one of the preceding claims wherein the nucleotide 
sequence modified encoded L or I. 

11. A method according to claim 10, wherein the modified amino acid is one or both of 
an isoleucine 3 amino acids and/or 10 amino acids from the C-terminus end of the 
completed heavy chain. 



12. A method according to any preceding claim, wherein the nucleotide sequence 
modified is within the sequence: 
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PTXiXiXsVS^XsXfiXTXsXgXioXnXuCXn 
where: Xi = N, H or L, preferably N 

X 2 = V or Y, preferably V 

X 3 = SorN 

X4 = an aliphatic amino acid, preferably V or L 
X5 = an aliphatic amino acid, preferably I, V or L 
Xe = M, V or L, especially M 
X 7 = SorA 
X« - EorD 

X9 = any amino acid, preferably G, V, A or T 

X10 = D, E, G or A, preferably D 

Xu = G or S, preferably G 

X12 = I, T, V, Z or A, preferably I or T 

X13 = may or may not be present and, where present is A or Y 

13. A method of adding J-chain binding capability to the heavy chain of an antibody 
comprising the steps of: 

(a) providing a nucleotide encoding an immunoglobulin heavy chain; 

(b) adding to that nucleotide a nucleotide sequence encoding a synthetic tail 
with the amino acid sequence: 

-(XaaOn, C(Xaa 2 )„ 
where: C = Cys 

Xaai is independently any amino acid with the proviso that it is not I or L or 
forms a consecutive sequence Xj X 2 X3 V S X4 (where X1 = N, H or L; X 2 = 
V or Y; X 3 = S or N; X» = aliphatic amino acid) 
Xaa 2 = independently any amino acid 
m = at least 2 
n = 0 to 5; and 

(c) expressing the completed nucleotide in a host cell to form an 
immunoglobulin heavy chain capable of binding J-chain. 

14. A method according to any preceding claim, wherein the host cell is a plant cell. 

15. A method according to claim 14, wherein the plant cell is part of a transgenic plant. 
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16. A method according to any preceding claim additionally comprising the step of 
isolating and purifying the antibody molecule. 

17. A method according to claim 16, wherein the antibody is subjected to a protease 
digest to for Fab or FCab^ fragments. 

18. An antibody obtainable by a method according to any preceding claim. 

19. An antibody containing a heavy chain comprising an a3 domain or a mu domain, 
the cc3 domain or mu domain lacking one or more targeting signals towards the C-tenninal 
end. 

20. An antibody capable of binding J-chain comprising at its C-terminal end the 
sequence: 

-(Xaa,) ro C(Xaa 2 ) n 
where: C = Cys 

Xaai is independently any amino acid with the proviso that it is not I or L or 
forms a consecutive sequence Xi X 2 X* V S X* (where X, = N 3 H or L; X 2 = 
VorY; X3 = SorN; X4 = aliphatic amino acid) 
Xaa 2 = independently any amino acid 
m = at least 2 
n = 0 to 5 

21. An antibody according to claims 18 to 20 which does not contain the targeting 
signal: X,X 2 X 3 VSX4 

where: Xi = N, H or L 
X 2 = VorY 
X 3 = SorN 

X* = aliphatic amino acid. 

22. An antibody according to claim 21, wherein the targeting signal is N V S V S V. 

23. An antibody according to any one of claims 18 to 22 which contains one or no 
isoleucine or leucine amino acids within the last 18 amino acids at the C-terminus of the 
heavy chain of the antibody. 

24. An antibody according to any one of claims 18 to 23 comprising at the C-terminus 
end of the heavy chain of antibody, the sequence: 
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-(Xaa 1 ) m C(Xaa 2 ) n 

where: C = cysteine residue 

Xaai = independently any amino acid with the proviso that it is not I or L 
or forms a consecutive sequence Xi X2 X3 V S X4 
where: X» = N, HorL 
X 2 = VorY 
X 3 - SorN 

X4 = aliphatic amino acid 
Xaa 2 = independently any amino acid 
m = at least 2 
n = 0 to 5. 

25. An antibody according to any preceding claim in which at least two, preferably two 
to four, glycine or alanine residues are present downstream of a C-terminal targeting 
sequence 

26. An antibody according to any preceding claim in which at least the terminal amino 
acid residue of a C-terminal targeting sequence is replaced by at least two, preferably two 
to four, glycine or alanine residues. 

27. Use of an isolated antibody according to any one of claims 18 to 26 in the 
manufacture of a medicament to treat disease. 



28. A method of treating a disease by administering an antibody according to any one 
of claims 1 8 to 26 to a patient. 

29. A method of prophylaxis, comprising administering an antibody according to any 
one of claims 18 to 26 to a person or animal. 

30. A vector comprising a nucleotide sequence encoding an antibody according to any 
one of claims 1 8 to 26. 

31. A host cell comprising a nucleotide sequence encoding antibody according to any 
one of claims 18 to 26 or a vector according to claim 30. 

32. A transgenic plant comprising a nucleotide encoding an antibody according to any 
one of claims 18 to 26. 
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33. An immunoassay comprising an antibody as defined in any one of claims 1 8 to 26 
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SEQUENCE LISTING 

<110> University of Warwick 

<120> Antibody Secretion 

<130> P708122GB 

<140> GB0226787.7 

<141> 2002-11-18 

<160> 69 

<170> Patentln version 3.2 

<210> 1 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 
<220> 

<221> MI S COFEATURE 

<222> (1)..(1) 

<223> Asn, His or Leu 

<220> 

<221> MIS C_FEAT URE 

<222> (2).. (2) 

<223> Val or Tyr 

<220> 

<221> MIS COFEATURE 

<222> (3).. (3) 

<223> Ser or Asn 

<220> 

<221> MI S COFEATURE 

<222> (5).. (5) 

<223> An aliphatic amino acid, especially Val or Leu 
<220> 

<221> misc_feature 

<222> (6).. (6) 

<223> Xaa can be any naturally occurring amino acid 

<400> 1 

Xaa Xaa Xaa Val Ser Xaa 
1 5 

<210> 2 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 

<400> 2 
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Asn Val Ser Val Ser Val 



<210> 3 

<211> 3 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 



<220> 

<221> MI SC_FEATURE 
<222> (1)..(1) 

<223> m copies of, independently, any amino acid with the proviso that 
it is not selected from He, Leu or forms the consecutive 
sequence described in SEQ ID: 4. m is an integer- of at least 2. 
Preferably, m is between 4 and 20, especially 5 to 10. 

<220> ■ 

<221> MI SC_FEATURE 
<222> (3).. (3) 

<223> n copies of any amino acid, the presence of each amino acid is 
independent of the other (s). n is an integer of 0 to 5. 
Preferably n= 0 or 1, especially 1. This amino acid is most 
preferably Tyr or Ala, especially Ala. Preferably 

<400> 3 

Xaa Cys Xaa 
1 



<210> 4 

<211> 6 

<212> PRT 

<213> Artificial 

<220> — ' 

<223> Definition of X at position 1 in SEQ ID: 3 



<220> 

<221> MISC_FEATURE 

<222> (1)..(1) 

<223> Asn, His or Leu 

<220> 

<221> MIS C_FEAT URE 

<222> (2).. (2) 

<223> Val or Tyr 

<220> 

<221> MIS COFEATURE 

<222> (3).. (3) 

<223> Ser or Asn 

<220> 

<221> MIS C_FE AT URE 

<222> (4).. (4) 

<223> Any aliphatic amino acid 
<220> 
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< 2 2 1 > mis c_f e a t ur e 
<222> (6).. (6) 

<223> Xaa can be any naturally occurring amino acid 
<400> 4 

Xaa Xaa Xaa Val Ser Xaa 
1 5 



<210> 5 

<211> 18 

<212> PRT 

<213> Artificial 

<220> • • 

<223> Modified targeting signal in the final antibody heavy chain 



<220> 

<221> MI S C_FEATURE 

<222> (3) . . (3) 

<223> Asn, His or Leu. Preferably Leu 
<220> 

<221> MIS COFEATURE 

<222> (4). .(4) 

<223> Val or Tyr, preferably Val 
<220> 

<221> MI S C_FEATURE 

<222> (5).. (5) 

<223> Ser or Asn 

<220> 

<221> MI SC_FEATURE 

<222> (8).. (8) 

<223> An aliphatic amino acid, preferably Val or Leu 
<220> 

<221> MI SC_FEATORE 

<222> (9) . . (9) 

<223> An aliphatic amino acid, preferably lie, Val or Leu 
<220> 

<221> MI SC_FEATURE 

<222> (10).. (10) 

<223> Met, Val or Leu, preferably Met 
<220> 

<221> MISC_FEATURE 

<222> (11).. (11) 

<223> Ser or Ala 

<220> 

<221> MI SC_FEATURE 

<222> (12).. (12) 

<223> Asp or Glu 

<220> 

<221> MI SC_FEATURE 

<222> (13).. (13) 

<223> Any amino acid, preferably Gly, Val, Ala or Thr 



<220> 
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<221> MISC_FEATORE 

<222> (14).. (14) 

<223> Asp, Glu, Gly, or Ala, preferably Asp 
<220> 

<221> MISC_FEATURE 

<222> (15).. (15) 

<223> Gly or Ser, preferably Gly 

<220> 

<221> MISC_FEATURE 

<222> (16).. (16) 

<223> lie, Thr, Val, Glx or Ala, preferably lie or Thr 
<220> 

<221> MIS COFEATURE 

<222> (18).. (18) 

<223> May or may not be present and where present is Ala or Tyr. Most 
preferably, where this amino acid present it is Ala. 

<400> 5 

Pro Thr Xaa Xaa Xaa Val Ser Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa Xaa 
15 10 15 



Cys Xaa 



<210> 6 

<211> 6 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 



<220> 

<221> MIS COFEATURE 

<222> (1)..(1) 

<223> Asn, His or Leu 

<220> 

<221> MIS COFEATURE 

<222> (2).. (2) 

<223> Val or Tyr 

<220> 

<221> MISC__FEATURE 

<222> (3).. (3) 

<223> Ser or Asn 

<220> 

<221> MIS COFEATURE 

<222> (6).. (6) 

<223> Aliphatic amino acid 



<400> 6 



Xaa Xaa Xaa Val Ser Xaa 
1 5 
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<210> 7 

<211> 28 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 
<40d> 7 

Ser Cvs Met Val Gly His Glu Ala Leu Pro Met Asn Phe Thr Gin Lys 
15 10 15 



Thr He Asp Arg Leu Ser Gly Lys Pro Ala Cys Tyr 
20 25 



<210> 8 

<211> 30 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 

<400> 8 

Ser Cys Met Val Gly His Glu Ala Leu Pro Met Asn Phe Thr Gin Lys 
1 5 10 15 



Thr He Asp Arg Leu Ser Gly Lys Pro Ala Ala Ala Cys Tyr 
20 25 30 



<210> 9 

<211> 38 

<212> PRT 

<213> Artificial 

<220> 

<223> Modified targeting signal in the final antibody heavy chain 

<400> 9 

Ser Cys Met Val Gly His Glu Ala Leu Pro Met Asn Phe Thr Gin Lys 
1 . 5 10 15 



Thr He Asp Arg Leu Ser Gly Lys Pro His Ala Ser Thr Pro Glu Pro 
20 25 30 



Asp Pro Val Ala Cys Tyr 

35 • 



<210> 10 

<211> 27 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic oligonucleotide 
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<400> 10 

ccatcgatgg aatggacctg ggttttt 

<210> 11 

<211> 27 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 

<400> 11 97 
ccctctagac tagtagcata ggccatc 



<210> 12 

<211> 31 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic oligonucleotide 
<400> 12 

actgtagaca attccgccac ctcagcctac a 



<210> 13 

<211> 31 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 

<400> 13 

tgtaggctga ggtggcggaa ttgtctacag t 



<210> 14 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 

<400> 14 

gagcagctca acagcgtttt ccgctcagtc ag 



<210> 15 

<211> 32 

<212> DNA 

<213> Artificial 

T 

<220> 

<223> Synthetic oligonucleotide 



<400> 15 

ctgactgagc ggaaaacgct gttgagctgc tc 



32 



WO 2004/046190 PCT/GB2003/004983 

7/19 



<210> 16 

<211> 32 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic oligonucleotide 

<400> 16 

ttgcccatga acttcgtcca gaagaccatc ga 32 

<210> 17 

<211> 32 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 

<400> 17 

tcgatggtct tctggacgaa gttcatgggc aa 32 

<210> 18 

<211> 33 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 

<400> 18 

aaacccacca atgtcgctgt gtctgtgatc atg 33 



<210> 19 

<211> 33 

<212> DNA 

<213> Artificial 



<220> 

<223> Synthetic oligonucleotide 
<400> 19 

catgatcaca gacacagcga cattggtggg ttt 

<210> 20 

<211> 30 

<212> DNA 

<213> Artificial 

<220> 

<223> Synthetic oligonucleotide 
<400> 20 

ccctctagac tatttacccg acagacggtc 



<210> 21 

<211> 32 

<212> DNA 

<213> Artificial 
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<220> 

<223> Synthetic oligonucleotide 

<400> 21 

gagcagctca acagcgtttt ccgctcagtc ag 32 

<210> 22 

<211> 8 

<212> PRT 

<213> Artificial 

<220> 

<223> Tailpiece of artificial C-terminus of heavy chain 

<400> 22 

Pro Ala Ala Ala Ala Ala Cys Tyr 
1 5 



<210> 


23 


<211> 


40 


<212> 


PRT 


<213> 


Mus 


<400> 


23 



Cys Met Val Gly His Glu Ala Leu Pro Met Asn Phe Thr Gin Thr lie 
15 10 15 



Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie Met 
20 25 30 



Ser Glu Gly Asp Gly lie Cys Tyr 
35 40 



<210> 24 

<211> 23 

<212> PRT 

<213> Mus musculus 

<400> 24 

Cys Ser Val Leu His Glu Gly Leu His Asn His His Thr Glu Lys Ser 
1.5 10 15 



Leu Ser His Ser Pro Gly Lys 
20 



<210> 25 

<211> 25 

<212> PRT 

<213> ALC_MOUSE 

<400> 25 

lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
1 5 10 15 
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Met Ser Glu Gly Asp Gly lie Cys Tyr 
20 25 



<210> 


26 


<211> 


25 


<212> 


PRT 


<213> 


Q9DCD9 


<400> 


26 



lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
1 5 10 15 



Met Ser Glu Gly Asp Gly lie Cys Tyr 
20 25 



<210> 27 

<211> 25 

<212> PRT 

<213> Q91WP5 

<400> 27 

lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
15 10 15 



Met Ser Glu Gly Asp Gly lie Cys Tyr 
20 25 



<210> 28 

<211> 25 

<212> PRT 

<213> Q99M22 

<400> 28 

lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
15 10 15 



Met Ser Glu Gly Asp Gly lie Cys Tyr 
20 " 25 



<210> 29 

<211> 25 

<212> PRT 

<213> Q91XE1 

<400> 29 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 
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<210> 30 

<211> 25 

<212> PRT 

<213> AAH29188 

<400> 30 

lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 31 

<211> 25 

<212> PRT 

<213> Q91WT1 

<400> 31 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 ~ 25 



<210> 32 

<211> 25 

<212> PRT 

<213> Q91WT3 

<400> 32 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
1 5 10 -15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 33 

<211> 25 

<212> PRT 

<213> Q8VCV5 

<400> 33 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 34 

<211> 25 

<212> PRT 

<213> Q91X92 
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<400> 34 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 35 

<211> 25 

<212> PRT 

<213> AAH28249 

<400> 35 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 "15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 36 

<211> 25 

<212> PRT 

<213> Q8VEA0 

<400> 36 

lie Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 37 

<211> 25 

<212> PRT 

<213> Q99LA6 

<400> 37 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 


38 


<211> 


25 


<212> 


PRT 


<213> 


Q91Z07 


<400> 


38 



He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 
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Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 



<210> 39 

<211> 25 

<212> PRT 

<213> Q99KA4 



<400> 39 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
1 5 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 " 25 



<210> 40 

<211> 25 

<212> PRT 

<213> Q91WR1 

<400> 40 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val He 
15 10 15 

Met Ser Glu Gly Asp Gly lie Cys Tyr 
20 25 



<210> 41 

<21-1> 25 

<212> PRT 

<213> AAH31703 

<400> 41 

He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val Ser Val lie 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
• 20 25 



<210> 


42 


<211> 


25 


<212> 


PRT 


<213> 


Q8VCX4 


<400> 


42 



He Asp Arg Leu Ser Gly Lys Pro Thr Asn Val Ser Val " Ser Val He 
15 10 15 



Met Ser Glu Gly Asp Gly He Cys Tyr 
20 25 
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<210> 43 

<211> 25 

<212> PRT 

<213> ALC2_HUMAN 

<400> 43 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
1 " " 5 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 44 

<211> 25 

<212> PRT 

<213> ALCl_GORGO 

<400> 44 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 45 

<211> 25 

<212> PRT 

<213> ALC INHUMAN 

<400> 45 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 46 

<211> 25 

<212> PRT 

<213> Q9UP60 

<400> 46 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 47 

<211> 25 

<212> PRT 

<213> Q9NPP6 
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<400> 47 

lie Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
1 5 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 * 25 



<210> 48 

<211> 25 

<212> PRT 

<213> BAC11114 

<400> 48 

lie Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 49 

<211> 25 

<212> PRT 

<213> Q96K68 

<400> 49 

lie Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
1 5 .10 15 

Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 50 

<211> 25 

<212> PRT 

<213> Q96KX8 

<400> 50 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 

Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 51 

<211> 25 

<212> PRT 

<213> Q96DK0 

<400> 51 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 
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Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 52 

<211> 25 

<212> PRT 

<213> Q8WY24 

<400> 52 



He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
1 5 - 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 53 

<211> 25 

<212> PRT 

<213> AAH32249 

<400> 53 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
1 5 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 54 

<211> 25 

<212> PRT 

<213> Q9BRV0 

<400> 54 

He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Vai 
1 5 10 15 



Met Ala Glu Val Asp Gly Thr Cys Tyr 
20 25 



<210> 


55 


<211> 


25 


<212> 


PRT 


<213> 


ALC 


<400> 


55 



He Asp Arg Leu Ala Gly Lys Pro Thr His Val Asn Val Ser Val Val 
15 10 15 



Val Ala Asp Val Glu Ala Val Cys Tyr 
20 25 
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<210> 56 

<211> 25 

<212> PRT 

<213> MUSJtfESAU 

<400> 56 

Val Asp Arg Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu lie 
1 5 10 15 



Met Ser Asp Ala Gly Gly Thr Cys Tyr 
20 25 



<210> 57 

<211> 25 

<212> PRT 

<213> MUCJ40USE 

<400> 57 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu He 
15 10 15 



Met Ser Asp Thr Gly Gly Thr Cys Tyr 
20 25 



<210> 58 

<211> 25 

<212> PRT 

<213> Q9BSZ1 

<400> 58 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
15 10 15 



Met Ser Asp Thr Ala Gly Thr Cys Tyr 
20 i 25 



<210> 59 

<211> 25 

<212> PRT 

<213> MUCBJiUMAN 

<400> 59 



Val, Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 



Met Ser Asp Thr Ala Gly Thr Cys Tyr 
20 25 



<210> 60 

<211> 25 

<212> PRT 

<213> MUC HUMAN 
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<400> 60 

Val Asp Lys Sfer Thr. Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 



Met Ser Asp Thr Ala- Gly Thr Cys Tyr 
20 25 



<210> 61 

<211> 25 

<212> PRT 

<213> Q9BQB8 

<400> 61 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 



Met Ser Asp Thr Ala Gly Thr Cys Tyr 
20 25 



<210> 62 

<211> 25 

<212> PRT 

<213> Q9BU10 

<400> 62 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
15 10 15 



Met Ser Asp Thr Ala Gly Thr Cys Tyr 
20 ~ 25 



<210> 63 

<211> 25 

<212> PRT 

<213> Q96BB9 

<400> 63 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 



Met Ser Asp Thr Ala Gly Thr Cys Tyr 
20 25 



<210> 64 

<211> 25 

<212> PRT 

<213> MUC_CANFA 

<400> 64 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 
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Leu Ser Asp Thr Ala Gly Thr Glx Tyr 
20 " 25 



<210> 65 

<211> 25 

<212> PRT 

<213> MUC SUNMU 



<400> 65 

Val Asp Lys Thr Ser Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu Val 
1 5 10 15 



Leu Ser Asp Thr Ala Ser Thr Cys Tyr 
20 25 



<210> 66 

<211> 25 

<212> PRT 

<213> MUC_RABIT 

<400> 66 

Val Asp Lys Ser Thr Gly Lys Pro Thr Leu Tyr Asn Val Ser Leu He 
1 5 10 15 



Met Ser Asp Thr Ala Ser Thr Cys Tyr 
20 25 



<210> 67 

<211> 25 

<212> PRT 

<213> M0C_CHICK 

<400> 67 

Val Asp Arg Ala Ser Gly Lys Ala Ser Ala Val Asn Val Ser Leu Val 
1.5 10 15 



Leu Ala Asp Ser Ala Ala Ala Cys Tyr 
20 25 



<210> 68 

<211> 22 

<212> PRT 

<213> Q8YOT9 

<400> 68 

Arg He Gly Gly Pro Pro Thr Gly He Thr Ser Asp Val Tyr Leu Ser 
15 10 * 15 



Val Tyr Glu Gly Val Cys 
20 
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<210> 69 

<211> 32 

<212> PRT 

<213> Artificial 



<220> 

<223> Modified targeting signal in the^ final antibody heavy chain 
<400> 69 

Ser Cys Met Val Gly His Glu Ala Leu Pro Met Asn Phe Thr Gin Lys 
15 10 15 



Thr lie Asp Arg Leu Ser Gly Lys Pro Ala Ala Ala Ala Ala Cys Tyr 
20 25 30 



